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(57) The space of a buffer 201 is logically parti- 
tioned into space reserved for requests only, space 
reserved for responses only, and space that can be 
used for either requests or responses, i.e., dynamically 
usable as needed by the system. An arbiter 301 uses 
three registers to keep track of the request buffer space 
302. the response txjffer space 303 and the dynamic 
space 304. The arbiter compares each of the registers 
with a corresponding limit 307. 308, and 309, to deter- 
mine if a request packet or a response packet should be 
sent to the buffer. The limits are set by software and 
define the maximum number of request products, 
response packets, and total number of pad<ets the 
buffer can hold. For exanple. the limit nrtay be set to 
eight requests, eight responses and ten total. Thus two 
spaces are reserved for requests arKf two spaces are 
r^erved for responses, and six are dynamically usable. 
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Description 

TECHNICAL FIELD OF THE INVENTION 



This invention relates in general to multi-processor 
computer systems and in particular to the allocation of 
queue space between requests and responses. 

BACKGROUND OF THE INVENTION " 

In prior systems, there are a certain number of 
request buffer locations and a certain number of 
response buffer locations. These buffer locations are 
fixed in the hardware and not changeable by software. 
In some systerris, the refquest and response txiffer loca- 
tions are on physically separate hardware devices. 
Thus, the fixed nature causes an inherent inefficiency in 
the ailocation of buffer space. Basically, the problem iis 
that if a large number of request packets were received 
which fill up the buffer additional, request packets would 
have to be halted until the request packet buffer is errtp- 
tied. This filling and emptying of buffers is time consum- 
ing and inefficient. 

A consequence of this problem is that the buffer 
response tirne is not constant there is a possibility that if 
a net pad<et is one of the type that can be sent out and 
used right away, and if there is no buffer space at that 
time, the packiet will be delayed until the buffer becomes 
open. This problem greatly affects performance of the 
system. 

It is therefore desirable to have a system that 
makes more efficierrt use of the available buffer space 
and reduces the probability that a data packet will be 
delayed due to congestion in the queue buffers. , 

It is therefore further desirable to have a system' 
that has more efficient distribution of data packets and 
positively impacts system perforniance, by n^intaining 
open queue buffer space, and thus allowing better 
throughput of data packets. 

SUMMARY OF THE INVENTION 

The invention partitions a single buffer into space 
reserved for requests, only, space reserved for 
responses only, and space that can be used for either 
requests or responses, as needed by the system. Under 
this arrangement, if at any instant one of the reserved 
buffers (requests or responses) goes into overflow, the 
dual use buffers can be used to keep the data flowing 
efficierrtiy. 

The processor agent chip or PAC has three internal 
registers that keep track of the status of the buffer 
space. One counter keeps track of request buffer space, 
a second counter keeps track of response buffer space, 
and the third counter keeps track of the total amount of 
buffer space that is being used. By using the three 
counters and comparing each of ttiem to three limits, 
the system is capak>le of dynamic buffer allocation. The 
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limits are set 'by' software and define the rriaximum 
nurrtber of request packets that can be in the buffer, the 
maximum number ' of response packets that can be in 
the buffer, and' the total number of both request and 
5 response packets that can t^e in the buffer. 

Before the PAC sends a request packet, the PAC 
polls counters to insure that the buffer is not full. The 
PAC checks the request counter to ensure that the 
request limit has not been exceeded, and then the PAC 
10 checks the total counter to ensure this packet will not 
exceed the total limit count for that txjffer. 

For example, assume the limit for the total txiffer 
size is set tor ten packets of both response and request 
types, and the limit for the request count is set for eight 
15 packets of the request type. That would mean that the 
request packet could be sent as long as there was not 
eight request packets currentiy in the request buffer and 
that there was ,not currently ten total packets of both 
- request and response types in the buffer. This basically 
20 means that the request packets are limited to a maxi- 
mum of eight of ten locations. And consequently, the 
otiier two locations are reserved exclusively for the 
response packets. " ■ ^ 

, I Similarly for responses, if the limit for the total buffer 
25 size is set for ten packets of both response and request 
types, and the limit for the response count is set for eight 
packets of the response type. That would mean that the 
response packet could be sent as long as there was not 
eight response packets currentiy in the response buffer ^ 
30 and that there was not currentiy ten total packets of both 
request and response types in tiie buffer. This basically 
, means that the response packets are limited to a maxi- 
mum of eight of ten locations. Gonsequentiy. the other 
two locations are reserved exclusively for the request 
35 packets. , . , 

Therefore, with the settings of 8/8/1 0, two locations 
/ are always reserved for requests, two locations are^ 
, _ always reserved for responses, and the remaining six 
locations can be used by for either responses or 
40 requests, as needed by the system. 

A technical feature of the invention is to use a single 
taiffer for tx)th request buffer space and response buffer 
space. 

Another technical feature of the invention is to 
45 reserve some buffer space for only requests, to reserve 
some buffer space for only responses, and to use the 
remainder of the buffer space for- requests or 
responses, as required by the system. 

A further technical feature of the invention is to use 
50 counters to keep trade of the status of the different types 
of buffer spaces, i.e. request, response, and total. . 

A further technical feature of the i nvention is to have 
the allocation of the different types of buffer spaces soft- 
ware adjustable. 
55 - The foregoing has outiined rather broadly the fea- 
,tures and technical advantages of the present invention 
_ , in order that the detailed description of the invention that 
follows may be better understood. Additional features 
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and advantages of the invention will be described here- 
inafter which form the subject of the daims of the inven- 
tion. It should be appreciated by those skQIefd in the art 
that the conception and the specific enrtoocfiment dis- 
closed may be readily utilized as a basis for modifying or . 5 
designing other structure for carrying out the same 
purposes of the present invention. It should also be real- 
ized by those skilled in the art that such equivalent con- ' 
structions do hot depart from the spirit and scope of the ' 
invention as set forth in the appended claims. 10 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present 
invention, and the advantages thereof, reference is now is 
made to the following descriptions taken in conjunction 
with the accompanying drawings, in which: 

FIGURE 1 d^icts a single* node of a fTiuHi-nodei 
multi-pr6cessor system that uses the inventive 20 
tjuffer allocation; 

FIGURE 2 depicts the arrangement of a 'single 
cross bar router chip (RAC); 
FIGURE 3 depicts the arrangement of a cross bar 
arbiter within' the RAG. * = ^ 25 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS - .^ * ' * 

FIGURE 1 depicts a single node of a multi-node. 30 
muftiisrcfcessor computer system'. The overall system 
may have a plurality of these nodes shown in FIGURE 

Each hode, in the emBodiment shown, can support 
up to sixteen processors 11 dr. These processors aire ' 35 
connected to processor agent chips or PACs 111. The 
function of each PAC 111 is to transmit requests from its 
associated processbrs 110 through cross bar router 
chips (RAC) 112 to the memory access chips (MAC) 
113 and then forward the responses back to the io 
requesting processor. Each PAC 'l 1 1 has an input/out- ' 
put (I/O) subsystem 117. Each MAC 113 corrtrols ■ 
access to its assodat&l coherent memory 114. \Nhen 
a. processor 110 generates a request to access mem- 
ory (or other resource), the associated PAC 1 1 1 sends is 
the request through the proper RAC 1 12 to a MAC 113. 
If the request is destined for memory 114 on the local 
node. MAC 113 accesses the memory attached to it If 
the request is destined for memory on another node. 
MAC 1 13 fonA/ards the request to TAC 115. TAG 1 15 is so 
the interface between the node and an SCI ring 116. 
TAC 1 1 5 is also known as a toroidal access chip or a 
SCI controller. The SCI rings 116 (not shown) intercon- 
nect the nodes in the multi-node system. 

FIGURE 2 dqaicts a single RAC chip 112 where- ss 
through packets flow from PAC -111 to MAC 113. and 
from MAC 1 1 3 to PAC -111. RAC chip 1 1 2 forms a ctoss- * 
bar which routes requests and responses in t>oth direc-' - 



tions- RAC has four PAC input txjffers 201, one buffer 
associated with a respective PAC. The RAC also has 
four MAC input buffers, one buffa* associated with a 
.respective MAC. Each of the PAC input buffers 201 can 
hold up to ten packets of both response and request 
types- 

PAC 111 sends a packet to the RAC 112. and 
where it is held in the PAC input buffer 201 until only 
RAC can send the packet on to MAC 113. Similarly for 
the opposite direction, then the MAC tries to tend a 
packet to a certain PAC. it goes through the proper RAC 
where it first goes into the input tjuffer on that RAC and 
is later on fonwaided on to. the proper PAC. The MAC 

' input buffers are not part of this invention. The PAC 
input buffers will hereinafter be refared to as buffers, 
the processors will make requests for data .and 

' provide responses for data. For example, there 'is a time 

' when processor 110 will make a request for a certain 
line menrtory. That request will be forwarded to PAC 111. 
PAC 111 fonwards it to. one of the two RACs 112 to 
which it is connected, and then to the MAC chip 113 that 
actually has the requested memory line 1.14. When PAC 
111 sends the request to RAC 112, the request is stored 
in buffer 201, and is later fonvarded to the MAC 113. If 
the request if to'a local memory line. MAC will access 
the memory line from memory and send it back as a 
response, to the original PAC. 

There are also times when processor 110 requests 
a line memory arid the line memory is cached in another 
processor. In that case, the request goes from PAC 111 
to MAC 113, and MAC 113 determines that the line 

' riiemory is at a remote location, and then sends a 
request to a second PAC to access the line memory, 
the second PAC routes the request to the appropriate 
processor, and then sends back the data from the niem- 
ory line as a response. So in this scenario, the PAC 
sends response' across the RAC to the MAC, where it is 
eventual delivered to the first PAC wherein the request 
originated.' 

FIGURE 3 depicts tiie arrangement of cross bar out 
arbiter 301 that resides in PAC 111. Arbiter 301 , in con- 
junction with buffer 201 performs the dynamic buffer 
allocation. Each art^iter 301 has three counter registers 
302. 303. 304. Counter 302 is a request counter that 
keeps track of the total nurnber of requests that are cur- 
rently in use in buffer 201 . Counter 303 is the response 
counter that keeps track of the total number of 
responses that are currently in use in buffer 201 . Coun- 
ter 304 is the total courtter that keeps track of the total 
number of buffer locations currentiy allocated for use by 
txrth responses and requests in buffer 201 . All three of 
these counters are used as inputs to cross-bar out arbi- 
ter 301. 

The other tivee inputs to arbiter 301 are the buffer 
' limits, which'can. if desired, be set by software. Request 
limit 307 indicates the maximum txiffer space that can 
*be used for reqCife^- packets. Response limit 308 indi- 
"cates the maximum buffer space that can be used for 
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response packets. Limit 309 irxiicates the total size of 
the buffer or the total' amount of both response and 
request packets that the buffer will hold. These three 
limits cause registers that can be written by the system 
to allow the limits to be flexible and changeable by soft- 
ware commands. 

Cross-bar arbiter 301, using data from the limit 
inputs and the counter'inputs*. decides whether to send 
a request packet or to send a response packet. Sup- 
pose, the PAC has an output directing the dispatch of a 
request packet and another output directing the dis- 
patch of a response packet. If the request limit is greater 
than the request couriter and the total limit is greater 
than the total counter, then it is safe to send the request 
packet. If the response limit is greater than the response 
counter and the total limit is greater than the total coun- 
ter, then it is safe to send the response packet. How- 
ever, if one of these groiips of conditiohb is not met, then 
the PAC can send the packet with the conditions that are 
met, thus preventing packet jams. Moreover, when the 
total limit is the same or less than the total courrter. the 
buffer is full and both packets would be held until the 
RAC sends information stating thatit has freed up some 
of the buffer locations. 

The buffer space reserved for request packets is 
the total limit minus the response limit, for the 8/8/10 
arrangement discussed previously. This arrangerhent 
yields a reserved request space of two. The buffer 
space reserved for response packets is the total limit 
minus the request limit, for the 8/8/10 arrangement dis- 
cussed previously, yielding a reservied response spac^ 
of two. The remainder of the buffer space, called the 
dynamic space is equal to the response limit plus the 
request limit minus the total limit, for the 8/8/1 0 arrange- 
ment discussed previously and yields a dynamic space 
of six. , 

Arrangements other than 8/8/10 are possible, how-' 
ever, at least one space should always be reserved for 
response packets. Otherwise', rf the buffer were to be' 
filled up with requests, no response packets could be 
moved, and the system could be dead-locked. .This 
would yield a 9/10/10 arrangement i.e. a maximum of 
nine requests, a maximum of ten responses, of a ten 
packet txjffer. The 9/10/10 system is the most dynamic 
system, with a dynamic space of nine. The least 
dynamic system would be a 5/5/10. with a dynamic 
space of zero, this effectively is the prior art arrange- 
ment. 

The 8/8/10 arrangement means at least a total of 
four spaces are reserved, and six spaces are dynamic. 
This arrangement has been found to work well at keep- 
ing the packets moving. A 7/7/10 is less flexible, with 
only four locations that are dynamically allocable, 
because three spaces are reserved for requests and 
three more for responses, . 

Whenever the PAC sends a request packet, it incre- 
ments both the request counter and the total counter. 
The RAC stores the request until the MAC is ready to 



receive it. When* the request packet is sent on to the 
MAC. then the RAC sends a count of the actual number 
of requests that have been sent out to the MACs. When 
the PAC receives this count information from the RAC, 
5 the PAC subtracts the number of sent out packets from 
both the request counter and the total counter. Thus, the 
counters are decremented when the RAG sends out the 
request. 

Whenever the PAC sends a response packet it 

10 increments both the response counter and the total 
counter. The RAC stores the response until the MAC is 
ready to receive it. When the response packet is sent on 
to the MAC. then the RAC sends a count of the actual 
nunrtber of responses that have been sent out to the 

15 MACs. When the PAC receives this count information 
from the RAC, the PAC subtracts the number of sent out 
packets from t>6th the response counter and the total 
counter. Thus, the counters are decremented when the 
RAC sends out the response. 

20 The RAC has no knowledge of the actual allocation 
of the buffer space between request arxJ response. The 
RAC knows that it is* receiving requests and responses 
packets, and the numljer of packets of each type being 
sent out- The RAC sends this information back to the 

25 PAC. Buffer 201 is ideally physically one buffer and is 
broken up logically into request buffers and responses 
buffers. However, the system coiild work 'with multiple 
physical registers. The RAC knoWs which buffer loca- 
tions are which, and which buffer locations currently 

30 contain responses, but it does not receive information 
atxjut the counters and limits in the PAC. 

Although the* present inyertbon and its advantages 
have been described in detail, it sfK»uld be understood 
that various chianges. substitutions and alterations can 

35 be made herein without delisting fronfi the spirit sind 
scope of the invention as defined by the appended 
claims. 

Claims 

40 " 

1 . A computer system having a processor that creates 
request packets to access mevnory and creates 
response packets that satisfy queries made by the 
system., the system comprising: 

45 ' 

a dynamically allocable buffer space for storing 
and routing the request packets and the 
response packets; and ^ 
an arbiter for receiving the request packets and 
50 the response packets from the processor, and 

for making determinations whether to send the 
request packets and the response packets to 
the buffer space. . 

55 2. The computer system of clairn 1. wherein said 
. determinations .are made inpart, under control of: 

a request counter tfiat stores a number value 
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representing a total nunrtoer of request packets 
currently stored in the buffer space; . . i 
a response counter that stores a number value 
representing a total number of response pack- 
ets currently stored in ttie buffer space; 5 
a total counter that stores a nun^er value rep- 
resenting a total nurrtoer of both request pack- 
ets and response packets currently stored in 
the buffer space; 

a request limit value representing a maximum io 
number of request packets that can be stored 
in the buffer space; 

a response limit value representing a maximum , 
numljer of response packets that can be stored 
in the buffer space; and is 
a total limit value representing- a maximum 
number of both request packets and response 
packets that can be stored In the buffer space. ^ 

3. The conputer system of claim 2, wherein: 20 



the arbiter compares the request lirnh with the . 
request counter and compares the total limit 
with the total counter to determine whether to 
serxi the request packet to frie buffer space; ^ 25 
if the request limit is greater than the request 
counter, and if the total limit is greater than the 7. 
total counter, then the arbiter will send the 
request packet and increment both* the request 
counter and the tcrtal counter; . . . . 30 
if the request limit is not greater than the .... 
request counter, tiien the arbiter compares the . 
response limit with, the response counter and 8. 
corrfpares the total lirnit with the total counter to 
determirie whether to' send the response 35 
packet to the buffer space; arid 
if the total linnit Is hot greater than the total 
counter, then the arbiter will send neither the 
request packet nor the response packet until 
the total limit is greater than the total counter. 40 

4. The comfxiter system as in daim 2 or 3, wherein : 

the buffer space transmits the request packet 
to a location in the system after receiving the 45 9. 
rjequest packet from the arbiter, and then relays 
transmission informafibn to the arbiter; and 
the arbiter receives the transmission informa- 
tion and decrements the request counter and 
the total counter. so 

5. The computer system of claim 2. wherein: 

the arbiter compares the response limit with the 
response counter and compares the total limit ss 10. 
with the total counter to deterrhine whether to 
send the response packet to the buffer space; 
rf the response limit is greater than the 



response counter, and if the total limit is greater 
than the total counter, then the arbiter will send 
the response packet and increment both the 
response counter and the total counter; 
if the response limit is not grater than the 
response counter, then the arbiter conpares 
the request limit with the request counter and. 
compares the total limit with the total counter to 
determine whether to send the request packet 
to the buffer space; and 
if the total limit is not greater than the total 
counter, then the arbiter will send neither the 
request packet nc^r the response packet until 
the total limit is greater than the total counter. 

, 6. The computer system as in. claim 2 or 5, wherein: 

the buffer space transmits the response packet 
to a location in the system after receiving the 
response packet from the arbiter, and then 
relays transmission infprmatipn to the arbiter; 

and ... 

the arbiter receives the transmission informa- 
tion and decrements the response counter and . 
the total counter . 

7. The computer system as in one of claims 2-6. 
wherein: 

the request lirriit. the response limit, and the 
total limit are alterable by software commands. 



The computer system a^ in one of claims 2-7. 
wherein: " . 

a reserved request ^Dace of the buffer space is 
equal to the total limit minus the response limit; 
a reserved response space of the buffer space 
is equal to the total limit minis the request limit; 
and V , 

a dynamically allocable space of the buffer 
space is equal to the request limit plus the 
response limit mirius the total limit. 

The computer system as in one of claims 2-8, 
wherein: 

the request limit is 8; 

the response Gmit is 8; 

the total limit is 1 0; 

the reserved request space is 2; 

the reserved response space is 2; and 

the dynamically allocal^le SF>ace is 6. 

A method for dynamically allocating buffer space in 
a computer system, the method comprising the 
Steps of: 
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accepting request packets to access a mem- 
ory; 

accepting response packets ttiat satisfy que- 
ries made by the system; 

determining whether to send the request pack- s 
ets and the response packets to the buffer 
space; 

dynamically aliocating the buffer space so as to 
accornmiDdate a maximum number of . both, 
request packets and response packets; and 10 
storing the request packets and the response 
packets in the buffer space. 
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for responses only, and space that can be used for either 
requests or responses, i.e., dynamically usable as 
needed by the system. An arbiter 301 uses three regis- 
ters to keep track of the request buffer space 302, the 
response buffer space 303 and the dynamic space 304. 
The arbiter compares each of the registers with a cor- 
responding limit 307, 308, and 309, to determine if a re- 
quest packet or a response packet should be sent to the 
buffer. The limits are set by software and define the max- 
imum number of request products, response packets, 
and total number of packets the buffer can hold. For ex- 
ample, the limit may be set to eight requests, eight re- 
sponses and ten total. Thus two spaces are reserved 
for requests and two spaces are reserved for responses, 
and six are dynamically usable. 
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